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Summary 
This 2roaduct Functional Specification describes a new 


GC9S %3 nulticomputer system desian for an information 
processiivjg conplex with siqnificantly increased availability 


and capazity which may he localty or aeograndhically 
distribut2d,. The multicomouter system design 1S an 
avyolicatian of the Distributed Systems Architectures 


a9ffering tightly coupled multicomputer systems support and 
local network capabilities. ALl multicomputer capabilities 
described in this PFS apply to GCOS 8&8 and are candidates for 
jaclausioxr ain other operating systems as the marketing 
requiremerits dictate. 


The nulticomouter system desian provides increased 
. availability and extensibility by making multiple computer 
“systa2ns ardpear to the end user to be ae single comoutina 
resource. Tne user visibility of an individual computer 
witrin tn2 conplex will not be areater than that of an 
1ydividual: processor within a traditional multiprocessor 
corfiaquration. Ahilte the multicomouter comolex provides the 
apoearanc2> of a single systems it is actually made up of 
Semi-autd10N04uS computers each with it's own operating 
Syst®t, cro0derating at a level which creates the perspective 
of a unified resource. This orovides both hardware = and 
software redundancye contributing to system availability. 
The nulticonputer complex can he configured so that the 
availability -of the total system will be uninterrupted by 
ary single-point hardware or software failure. The 
dsolatior provided by the multicomputer system desian will 
also contrisute to availability by timiting the effects of 
ary failure to a portion of the total complex. System 
isolation also provides enhanced extensibility. Transaction 
processing and timesharina loaon workstationss and a aqlobal 
batc4 scieduler eliminate the requirement for end user 
knowledge of the configuration. In additions an attemot iis 
made to. distribute the workload judiciously among the 
comouters in the comolex. System transparency is enhanced 
by shariig resources between computers. The multicomputer 
Syst?m 428199 provides the framework for incorporating 
functionally dedicated processors into a multiconputer 
comolex, For examples a database machine could be 
construct2d from standard hardware and software products, 
ayd incorsorated into the comolex utilizing the high level 
data interface and the location independent storage 
managemert casabilities currently being specified as pnart of 
DSA, 
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Certain aoplications (Ce.q. word processing and araphics 
stations) require high data transmission rates and many 
jaterconrzcted computers and devices. The multicomputer 
systen d2sigr will orovide NASA hased Local network 
caoadilities for processors connnected by a high speed bus 
Cinter-conput2r connection). For examples word processing 
Stations could store and retrieve documents from a host 
systen's dataoase. The high speed inter-computer connection 
would facilitate the transfer of documents between stations, 
as w@2ltl as setween the host computer and the specialized 
stations. A Local network can be confiaured as part of a 
g2ographically distributed network. HIS systemss other than 
GCOS &s arid other vendor's systemss can be accommodated 
throuaqh 3a network job entry facility and through a file 
traisfer system. 
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To DAtPeauetion 


Recert irdustry and Parc trends indicate the necessity 
of oroviding locally distributed functional seoaration of 


processiyjgs extensible systemss and hiah availability 
systems. Tnis facts» alona with the realization that 
Distributed Systems involve more than Networks and 
Comnunicationses has led to the conclusion that the 
Distribut2d Systems Architecture (DSA) should he used to 
prodive a new multicomputer capability. The current 


ca9ability. known as Shared Mass Storaqes which has served in 
a subset cadsdacitys 18 recognized as being inadecuate for the 
future, 3COS 8 now has the opportunity to provide this new 
mualticomouter capability to the userss which is aoverned by 
the OSA protocol, interface, and administrative 
soecifications.e 


The narket requirements for this new multicomputer 
cadability are contained in "LISD Market Requirements For a 
Maltarcomauter Comolex” (see document 3.19). 


1.1 Defirition of Wulticomputer 


Conosuter - One or more processors sharing main memory” along 
witn associated I/0 and oeripheral subsystems. 


Coanouter system -°- An information processing system 
consisting of a single computer and onerating system 
supogartirag application programs and databases. 


Multicomouter system —- An information processing system 
consistirg of two or more interconnected computer 
systems. The operating systems on the various 
comduters cooperate to provide a sinqte end user 
interface ain which the confiauration is typically 
traisparent to the end user. The essential 
characteristics of a multicomputer system are: 


- The nulticomputer system appears to the end user to be 
a sirigle computer system. 


- The zsomputers are typically located within ae limited 
area. Normally this is a single building or building 
comol ex. 


- The quicx exchange of information amona computers is 
critical and is tysically performed via high speed 
conn2ctions. 
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1.2 Objectives 


The following objectives have been established for the 
maltitcomouter system design to meet the requirements 
described i7 “LISD Market Requirements For a Multicomputer 
Comolex™ (see document 3.19). 


- The multicomputer system desian must increase the 
availasility of the total computina system. This does 
Not recessarily mean that the reliability of any sinale 
component will increases but the availability as viewed 
by tye total end user population must substantially 
increase over that of a sinale independent computer 
Syst2m. | 


- The nulticomputer system design must increase computing 
power through the Interconnection of multiple 
comouters. This does not mean that the comoutinga power 
seer dy an individual end user will tncreases hut that 
the sonputing capacity viewed by the total end user 
population can be substantially increased over that of 
a siigle independent computing system, 


- The nulticomputer system desiqn must ease the migration 
of G20S 8 users to new releases of GC9S & and to 
Cinstallation of) new computers. It is envisioned that 
the i intarfaces defined will be standardized and that 
eacn system of a multicomputer comp lex can be. 
individually and incrementally updated from one 
operating system and software release to another, 


- The multicomputer system desiqn must provide an 
evols tio to Locally distributed functional 
capadilities (Ce.g. snecialized processors). 


- The nulticomputer system design must provide a single 
uniforn user interface to the total computing conplex. 
That ise the user interface must he independent of the 
multicomouter configuration and the Location at which 
hardware and software services are performed. 


- The multicomputer system desiaqn must suodport the 
Intercaniection of other HIS and non-HIS computer 
Syst2ms in a single multicomputer complex. The 


objective is for GCOS & to support Network Job Entry, 
File Trarsfers, and cooperating user applications (for 
such systems. 


Multicomputer $895 3243 7 1 
Global PFS DRAFT 


2ef Business Stratagy 


Successful establishment of Honeywell as the leader in 
multicomouter technology 1s not only desirabte for PARC 
protectioyvs but It provides a vehicle for penetration of 
comoetitor's markets. It also furthers the adoption of OSA 
as an industry standard and reaffirms our leadership in 
Distribut2d Processing. 


The nulticomputer system desiqn described in this PFS 
provides: 


- Increased power for customers whose requirements exceed 
the Ccaogacity o f our largest tiahtly coupled 
multimorocessor systems. 


- Availasility aporoaching utility grade service (for 
customers whose business depends on non-interruotable 
operatioy of the computer complex and fast response, 


-- Non-4dIS nost connections for special ourpose machines 

 (Cdatabas2, jarray processorss etc.) or to permit the 
penetration of non-HIS customers in a highly efficient, 
coexisterit environment. 


- Fase of migration to new Honeywell machines and 
operating systems for customers who cannot risk major 
hardware or software chances In a production 


environnent. 


- Allows additional customer flexibility in the manner in 
whicr he can take advantage of the decreasing cost of 
proccessing power while retainina the advantaqes of 
centralized operation. 


" 2 
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4.9 Apolicable Documents 


NSA Reference Book ~- Volume 1 (General Description) 
APSE/NSA Product Set 1 PFS=S280532238 

Unified Network Control Processor PFS RxX-006 

Yvified ieeosk Anplicatian/Satettlite Processor PFS=-39R4B 
Unified File Transfer PFS=5S38N2NS63 


Praoaqrannatic Interface to Presentation Controls 


November 
23, 197) | 


Proocranmnatic Interface to Messace Managements Reve 4e July 
117 197) 

NSA Link Protocols, DSA-51 

DSA Network Protocols, DSA-52 

DSA Trarsoport Multiplex Protocols, DSA-53 

DSA Conrection Protocols, DSA-54 

NSA Dialog Protocol, DSA=-S5 

NSA Presentation Control Protocols DSA-56 

L465 = JVC? €Common) Exchange Interfaces ASA-57 
DSA Network Batch Protocols, DSA-40 

Remote rile Access Protocols DSA-41 


Renote File Access Protocol Logical File Transfer Subset, 


DS A-462 
DSA Metwork Control and Administrations DSA-70 


LISD Market Reauirements for a Multicomputer Complexs, LISD 
P & PM» Dec. 106 1979 


AN> Hyos2rvisor Software EPS-S5S8061027, R. Cartlistes Nov. 


129 197) 
Systems description: Hyperchannel Network Adapters, 
Network Systems Corporations Publication now AQ1-9000-02, 


Rey. 20 Jan 1978 
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4.0 Develosomert Strategy 


The develrdpmert strateay for the multicomputer system desian 
is to divide the multicomputer capability into a series of 
develoomert osahases (Cor steos) which can be implemented over 
a period »f several years. The subset to be implemented for 
eac? phase will he implemented usina the desian 
sxyecificationrs for the full multicomputer capability. 


The advaritages of the phased development ares: 
-- A phased develooment reduces the imolementation risks 
by reducing the numher of new functions in each 


rel23S59., 


- A phased development olan permits a short term release 
9f 3 subset of the multicomouter architecture. This 


will pernit early user visibility and feedback. The 
knowledge gained from this exposure will improve later 
phas2?s. 


- A oghased develooment plan permits a short term release 
of 3 Shared mass store equivalent subset of the 
multicomouter capability to meet the marketing 
requirement defined in “LISD Market Peauiremnents For a 
Mualticomoauter Comntlex”™ (Csee doc. 2.19). 


4.1 Phase 1 Develooment 


The asrinary purpose of Phase 1 18 to provide ae solid 
fourdatioara on which the comolete multicomputer functionality 
can be built. This phase has four major qoals: 


- FEstardlisriment of the multicomputer system conventions. 

- Implementation of the kernel functions o f the 
multicomnouter capability. 

- Implementation of a functional replacement for shared 
mass storaqe,. 

- Provide a customer release of the multicomouter 
capadvility to satisfy market requirements soecified in 
docunert 3.19. 


Phase 1 nust neet the fotllowina requirements: 
- Estaadlis 1 the conventions and standards for subsequent 
multicoamouter development. These will be based on the 


approdpriate PSA standards. 


+ Establish the multicomputer confiauration 
definition lLanquaqe which includes: 
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Definition of the computers in a complex. 


*« Definition of the Locations of system 
workstations. 
x Pefinition of peripheral devices and 


communications connections. 


Establish the multicomputer operator lanauaaqde. 
Establish the mailbox naming conventions for both 
network and multicomputer confiqurations. 
Establish the default mailbox names for both 
System and user workstations. 

Establish t he conventions for all system 
workstations. 


-- Provide the basic system functions (kernel) required 


for ¢ 


+ 
+ 


- Impl 2 
the d 


he full multicomputer development. 


Inplement the comolete NSA Session Control 
Programmatic Interface. 

Inplement functions necessary to create and 
Nitiate system workstations. 

Inplement recovery functions for system 
workstations. 

Inoplement startup and restart functions for 


malticonputer configurations. 
Inplement functions for user and system orocesses 
to se standard workstations. That is: 


x Administration of default mailbox names. 

ke Creation of deault mailboxes at process 
Initiation. 

x Loading and initialization of shared code 
(such as session control) early in startup so 
that it can be used by system workstations. 


Modify system processes to be system workstations 
and to use Session Control Programmatic Interfaces 
for iIinterprocess communications. These system 
processes include Scheduler, PALC, GEOT, GEIN, 
PIP“. TSS» and others. 


ment functional replacement for shared ee 
ge which must include support for: 


Mualticomputer shared disk drives and packs. 
Multicomputer shared disk files with access 
control at the file Level. 

Multicomputer shared input (local and remote). 
Multicomouter shared outout (Clocal and remote). 


ment Common Exchange Interface (CXT) support for 
IS Inter System Link CISL). 
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4.2 Phase 2 Develonoment 


The orinary ourpose of Phase 2? is to provide the 
capability. to run multiple system releases concurrently in a 
malticomoauter comolexs provide enhanced resource management 
of shared resources and provide enhanced availability. 


Phase 2? nust neet the following requirements: 


- Impl2ment multicomputer functions to the point where 
multiple operatina system releases can run 
concurrently. 


- Impl2ment enhanced resource manaaqaement o f shared 
resources to supports 


+ Multiple comnuter connectability (shared ophysical 
connection) tape drivess and unit record 
equipment. 

+ Srared and exclusive allocation of disk packs at 
the volume set level. 

+ Exclusive allocation of multiple computer 
coniected tape drives and unit record equioment. 

+ Data integrity control at the control interval 
level for disk files as specified by the DSA I/0 
architecture, 

- Impl2ment enhanced availabiltiy and recovery functions 
whicy suoports: 


+ User planned recovery of user workstations when a 
comouter fails. 

+ Integrated processina of Journals at the 
multicomputer complex level. 


- Impl2ment the multicomputer operator functions which 
provides a single operator interface to control the 
multicomouter complex. 


4.3 Phase 3 Develooment 


The prinary purpose of Phase 3 18 to achieve increased 
corfiguration indenendence. 


Phase 3 sriould meet the followina requirements: 


- Shar2d prysical connection of disk devices is no Longer 
a functional requirement. At this phase the use of 
shar2d pnysicat connections is only required to meet 
specific availability and performance reauirements. 


- Share 
be 
inde) 
multi. 


- Enhay. 


+ 
+ 


4.4 Phase 4 Dey 


The or 
different 
suooort $s) 


Phase 4 5) 


- Supo99 
fAro" 
and 
syst2 
requl 


- Supa 
Inter 
inter 
the ) 
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d logical connection of all relevent devices will 
Supported (1.@€. a computer can access device 
erdent of where it 1s confiqured in the 
computer complex). 


ced resource manager and scheduler which support: 
Scheduling based on resource requirements. 


Resource allocation and scheduling based on global 
resdurce utilization. 


Resource allocation and scheduling based on shared 


logical connection of devices. 


ced Media Management which orovides coordination 
dia at the multicomputer level. 


elooment 


inary purpose of Phase 4 is to Support multiple 
Qperating systems running concurrently and to 
ecialized processors. : 


ould meet the following requirements: 


rt running multiple operatina systems concurrently 
gi coexistence based on the DSA Network Job Entry 
File Transfer facilities. The following oneratina 
ms will be supported based on specific marketing 
rementss : 


Gcos & 
CP 5 
Multics 


Level-64 (not to be done by LISD) 
Level-6 (not to be done by LISD) 


rt for specialized processors such as? 


CO3S9L machine 
Fortran machine 


etc. 


rt a database machine throuah a non-procedural 
face that is much hicher than the record-at-a-time 
face. This interface is currently beina added to 
SA specification. 
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S$.0 Hardware Configuration 


The market requirements for the multicomputer system 
design call for a wide ranae of possible configurations to 
satisfy tre variations in customer anoplications. Further, 
it 1s expected that the ranae o f multicomputer 
configurations wilt reflect evolutionary changes taking 
place 11 #thea computer industry. Therefore, the hardware 
support for multicomouter nust be designed and implemented 
i” the nost openmended and flexible manner possible. This 
trcludess 


- The numoer and type of computers which can be 
configured in a multicomputer comolex. 


- The vunber and tyne of peripheral devices and their 
topology within the multicomputer complex. 


- The abtlity to configure all hardware redundantly for 
maxinun availability. 


5.1 Conouters 


The multicomouter system desian supports the 
Tyterconrectioan of computers of many different sizes and 
tyoes. f13Is includes: 


- single processor computers 
- tightly coupled multiprocessor computers 
- functionally specialized processors 
(2.36 database machine) 

- ALL 3C9S 8&8 compatible computers 

Ce.gevr ELSs DPS-E, ANP, CREA etc.). 
- Other HIS computers 

Ce.ger LOR L444, LA etc.). 
- noneilS comouters 


S.2 Inter-comosuter Connection 


The nulticomouter system desian requires that each host 
conouter ina multicomnuter complex he able to communicate 
wit? all other host computers in the complex (See section 
oon Since the PSA Session Layer standard does not inctude 
m2ssaqe switching in the host computers the inter-computer 
connectio2a nust allow each host computer in the comolex to 
comnunicate with every other host computer in the complex 
without 9o211g routed through an intermediate host. The 
physical characteristics of this connection (multidroo,s 
p2imt=-to-x,o0ints the distance between computerss the number 
o f computers and the speed of the connection) shall be 


ae 
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Linited orly oy the cost-effective technologies available. 
Ther? Nay be different imolementations of this 
7ater-conzuter connection to meet the requirements o f 
different multiconputer confiaurations (number of comouters, 
availability and performance requirements, database size and 
usages ett.)-s 


- The nirinum iInter-comouter connection immolementation 
must have the following characteristics: 


+ It nust be compatible with the DSA Common Exchanae 
Interface (CXI). 

+ It nust be high sneed (>= 5 meqabytes/sec.). 

+ It must be capable of connectinq up to four 
comouters In a multicomputer complex. | 

+ It nust be canable of connecting multiple HIS 
comauter tynes. 

+ It nust be capable of distributinag the comouters 
over a relatively long distance (>= 70 meters). 

+ It nust be capable of connecting multiple parallel 
data paths for increased performance and 
availability. 


- The naxinum inter~comouter connection implemented must 
have the following characteristics (these 
characteristic are similar to the MSC Hynerchannel; see 
docunent 3.21): 


+ It nust be compatible with the DSA Common Exchange 
Interface (CXI). 

+ It nust be high speed (>= 5 meqabytes/sec.). 

+ It nust be capable of connectina at least 256 
comaouters in a multicomputer complex, 

+ It nust be capable of connecting multiple computer 
tyoese including non-Honeywell mainframes. 

+ It nust be capable of distributing the computers 
over arelatively long distance (>= 1700 meters). 

+ It nust be capable of connecting multiole pnaralletl 
data paths for increased oerformance and 
availability. 


Perinnrerals 


The peripheral configuration supported by the 
mualticomouter system design must be flexible and extensible 
t5 meet the varyine requirements of individual 


iastatllatiors. These requirements include the following? 
- Functional capability (see section 6.5) 


- Avaiability (see section 46.8) 


Sree rs 
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- Perfornance (see section 7.0) 


Three basic types of perioheral connections are required 
t> Suoport these requirements. These connections are not 
tne only alternatives available nor is a multiconputer 
configuration Limited to asingle type of connection. In 
fact, th: types oof peripheral connections used in a 
configuration form an extremely important consideration 
bearing 391 the availahility and performance of the comolex. 


The three types of physical connections are: 


-- Connection af devices to a single computer (non-shared 
physical connection). 


- Conri2zctian of devices to two or more componuters (shared 
physical connection). 


- Conr2zctiaon of devices to the tnter-comnuter connection 
via an adaptor (shared physical connecticn). 


lad 


Siraale Comouter Connection (Non-Shared Physical Connection) 


Fiqur2> 5.3.17 illustrates non-shared physical connection 
of devic2s td a single computer in a multicomputer complex. 
The differenrce between cases Av Bo and C Is the Level of 
availabilty and performance provided through hardware 
redundancy. 


The yon-shared physical connection of peripherals has 
o72@ major disadvantage which will limit its usaqe in some 
jiastatlations. Devices connected in this manner make the 
system susceatible to single point failures. If a computer 
fails, all. of the devices connected only to that computer 
are lost from the entire complex, Furthers, a computer 
witridaut shared ohysically connected devices limits the 
recovery 9f system work stations in that computer. 


Multiole Comouter Connection (Shared Physical Connection) 


Figur2 5.3.2 illustrates shared physical connection of 
devices td two or more comouters in a multicomputer complex. 
The difference between cases A and fF Is the level of 
availability and performance provided throuqh hardware and 
software reduradancy. 


The orimary advantage of the shared physical connection 
is availability. Fiaqure 5.3.2 case B. provides complete 
protectio1 from a single point failure. Tf one of these 
conoauters fails alt of the work of that computer can be 
performed by the other computer (Coossibly with performance 
degradatiojdn). This includes the recovery of system 
workstations. 
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8 second advantage of the shared physical connection is 
oerformaice. If an installation has a sinale large database 
which ca%inot be divided into resource partitionss a sinaqle 
comoauter nay not be sufficient to handle the load. This 
type of cornection provides the neccessary additional power 
at the cost of some additional inter=-computer connection 
traffic for resource management. 


Ss. 5e2 Inter-conputer Connection (Shared Physical Connection) 


Figure 5.35.3 illustrates the shared physical connection 
of devic2s to the inter~=computer connection. This is 
acconolisried through the use of an adaptor for the specific 
device type. This adaptor may vary from non-existent to a 
very cons lex function denending on the inter-~computer 
co77ectid inplementation and the specific device. The 
diffsrenc2 setween cases Av FF and C is the Levelt of 
availability and performance provided through hardware, 
redundancy. Support for this tyoe of connection 1s 
denserdent us01 the existence of market requirements. 


The disadvantages of the physical connection of devices to 
the inter-comouter connection are: 


- Ther? are limitations on the capacity 9 f the 
intes~conputer connections based on cost and 
tecniologys which may limit the useful applications. 


- There (are architectual problems with controlling 
physical access to such devices by multiple operating 
Syst2m. This is complicated when a foreian host is 
configured in a multicomputer system, 


The advartages of the physical connection of devices to the 
riter-conduter connection are: 


- ALL devices are equally accessable by all computers in 
the multicomputer comolex. Thereforesr this provides 
the naxinum possible multicomputer availability.. 


- Sinc2? usage of these devices never requires two 
operating systemss this type of connection incurs the 
least overhead for non-local use of devices. 
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Figure 5.3.1 
PHYSICAL: CONNECTION OF DEVICES TO A SINGLE COMPUTER 
INTER=COMPUTER CONNECTION 
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Figure $.32.2 
CONNEZTION OF DEVICES TO MULTIPLE COMPUTERS 
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Figure 5.3.2 
CONNECTION OF DEVICES TO INTER=COMPUTER CONNECTION 
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A.M Software 


The realization of the full multicomputer canability 
requires yew desians which denart from traditional operatina 
System designs. These designs will involves: 


- Use of DSA concepts and interfaces 1n timolementing the 
multicomoauter capability to provide product 
evolutiorary compatibility and inter product 
comoatibdility. 


- Redesignina system components (such as schedulers 
resource managers etc.) to be system workstations and 
to 182 gqglobat alaorithms in addition to Local 


algorithns. 


- Restructure system startup into steps with each step 
usi1gj functions provided by  oporevious steos. For 
exanola, the Session Layer must be initialized early in 
Startud so that System Work Stations can be initialized 
and communicate during startup. 


- Enharcenents to availability and recovery concepts are 
required. The conplexity and topology of the 
multicomouter confiauration will require automation of 
Operations traditionally left to the onerator. 


&.1 Architecture 


The nulticomouter system desian 1s a specific OSA 


product set which implements the maximum Level of 
cooneratiodn between computer systems tn a DSA network. The 
Level 9 f cooperation is intended to provide end-user 


visidility equivalent to that of a sinaqle computer system. 


The comnurications facility interconnecting computers in 
the multiconputer system desiqn 18 based on the _ DSA 
definitior of a primary network. In order to distinaquish 
the comnunications facilities interconnecting the comouters 
77.030 Oomaulticomputer comp lex from the communications 
facilities through which a multicomputer complex ~*may 
communicate with other computers or other multicomputer 
conalexesr, the multicomputer interconrection is defined as 
the "multicaonputer orimary network". The multicomputer 
orinary  retwork may be a totally independent communications 
network (€see Figure 6.1.1) or a portion of a laraqer 
comnunications network (see Figure 6.4122). 
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The organization of system functions in the 
milticomauter system desiqn is based on the DSA conceot of 
cood2ratirig workstations. Three workstation classes are 
defined for multicomouter based on the confiauration 
attributes of the workstation (1.e. the systems in a 
multicomouter complex In which an occurrence of the 


workstation may be executed). These classes are: 


- Class i workstations - A workstation which may have any 
numoa2r of occurrences in any number of the systems in a 
mualticomouter complex, Apolication workstations are 
typical of this class. 


- Class 2 workstations ~ A workstation which must have 
one aid only one occurrence in each system within a 
multicomauter complex. 


- Class 3 workstations - A workstation which must have 
one and only one active occurrence within a 
multicomouter complex, There may be one or. more 
inactive occurrences within a multicomputer complex one 
of which would become active in case of a failure of 
the currently active occurrence. 


Tt sx ould be r090ted that class 3 workstations imnlies a 
hTerearentt al relationshio between workstations. while this 
may apoear inconsistent with the concept of distributed 
processiijgre the feasibility of imoplementina a totally peer 
orierted yrjgarization requires further study. Thuss for a 
first inplenentation a hierarchical structure must be 
considerad. 


In tre detailed desiqn phase each system function 
performed in a multicomputer complex will he analyzed and 
arouned 11:t> workstations of one the these three classes. A 
conceptual overview of the relationshin between workstations 
17 a multicomouter complex is illustrated in figures 6.1.3 
aid Gel a4 For this illustration the followina 
workstationss their function and classs are defined: 


- Multicomouter Administration Workstation (MCADM) - A 
class 3 workstation which is responsible § for 
admiiristration of the multicomputer complex. Its 
functions include startups availabilitys recoverys and 
configuration. 


- Scheduler/Resource Manaqer Workstation (SCH/R™) = A 
class 3 workstation which 1s responsible for the 
schedulirg of all works, the allocation of all 
resourcess and integrity control for the multi-conputer 
comol ex. 
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Operator Workstation (OPR) = A class 3 workstation 
resooansiogle for handlina the operator interface for the 
muiticomouter complex. 


Multicomouter Availability Workstation  (MCAVL) - A 


class 2 workstation which 1S responsible for 
coorjiiryating the system in which it resides with the 
Multicomouter Administration Workstation. Its 


functions include local system startuns availability, 
and recovery. 


Initiation Workstation CINIT) = A class 2 workstation 
whic 1S responsihle for initiating workstations and 
proc2sses in the system in which it resides. This is 
achi2ved through cooperation with the Schedule/Resource 
Manager wdorkstation. 


Syst2m Inrut Workstation C(SYSTN) - A class 1 
workstation responsible for reading batch job inout and 
inter facing with the SEcheduler/Resource Manager 
Workstation. 


 Syst2m dIJutput Workstation (SYSOUT) - A class 1 
workstation resoonsible for dispersing batch job 
outout.e This is achieved through cooneration with the 
Scheduler/Resource Manaaer Workstation. 
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Figure 6.1.17 
INDEPENDENT MULTI-COMPUTER PRIMARY NETWORK 
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Figure 6.1.2 
SUZBSFT MULTIW-COMPUTER PRIMAPY NETWORK 
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Figure 6.1.3 
SOFTWARE OVERVIEW NOTATION. 


° ‘ Class 1 Workstation 
ee ° Class 2 Workstation 
es _ Class 3 Workstation 
od a SESSION INTERFACE 
meee eens —--- PROCESS AND WORKSTATION INITIATION 


> ARROW = directed arc/edae 


Undirected arcs/edaes imply peer 
relationships hetween workstations, 


Nirected arcs/edagqes imply 
denendent/dominance relationships, 
1.2@. the workstation at the base 
of the “arrow” is the dominant 

and the workstation at the head 

of the "arrow" is the denendent. 
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Figure 6.1.4 
SOFTWARE OVERVIEW 
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6.2 Operating systems. 


hetel 


The software implementations within the mutticomputer 
system d2sign will oermit individual comonuters within a 
multicomoauter complex to be undated to a new release without 
having to update the entire complex and without shutting the 
comalex dowr. This will be accomplished by defining the 


1ater-conzduter protocols for the total multicomouter 
capability sefore the initial implementation so that they 
wilt renain stable from software release to software 


release, Howevers, evolutionary changes tn the multicomputer 
requiremerits nay from time to time require changes in the 
HaSiC. nulticomputer desian. When this happens 
INcdrnpatias,ilities between software releases may occur which 


will require upqrading the software on the entire comptex. 
aEOS 8 


Tre nulticomputer system design supports running 
multiple releases of GCOS 2&8 concurrently within a sinale 
multicomauter comolex. Howevers there may he GCOS & chanaes 
fron release to release which affect the user or oanerator 
Titerfac?2. Depending on the specific change the 
mialticomouter configuration may not he totally transparent 
t> the and user or operator. Fossible areas of visibility 
ticlude: 


- Operator commands 
- Job control lLanquaaqe 
- User/system execution time interfaces 


Ix order to control such visibility and allow more releases 
t> run corcurrentlys the following functions are required: 


- Ther2® will be optional. JCL commands ta specify which 
release of GCOS & is required. If not specifieds the 
site soecified default release for the complex is used 
(the default can be "any release”), 


- The nulticomouter system desian will build on the 


Pistriouted Maintenance Service (DMS) Software 
Envirornent Database facility to identify compatiblity 
hetw2e software components. Thus » it will 


autonatically identify which software releases are 
comoatidle with ae users application and schedule the 
application on the anpropriate computer mn the 
multicomouter comolex. 
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3C9S 3 Support of Other Operatina Systems 


The G20S & imolementation will include OSA protocols to 
Sunpoort coexistence of GCOS 8 and other operating systems 
(sooth HIS and non-HIS) within the multicomputer complex, 
This coexistence will be hased on the DSA Network Batch 
Protocol (see doc. 3.15) the DSA Remote File Access 
Protacol (see doc. 5%. 00 and e617) ¢ and system 
administration protocols. Conformance to these protocols by 
the "foreiqi”™ system will be required for connection in a 
malttcomoauter complex. 


The admiristrative protocols supported by GCOS 8&8 ares: 


- Operator messaqe protocol for system operator input and 
Outduit. 


- 6409S 8 will maintain the status of all activities in 
the nulticomouter system. There will be a protocol for 
activity status update and inquiry. 


- Startus and recovery protocol to coordinate system 
startup and recovery. : 


- Availadility protocol for the exchanae of system status 
Information. 


- Accounting orotocol for exchance of accounting records. 


- allozsation of peripheral devices to non GCOS R 
operating systems for exclusive use. 


The functions based on the SSA Network Batch Protocol and 
DSA Remot2 File Access Protocol include: 


- JCL commands to specify the operating system on which 
activities are to ke executed. 


- trarsf2r of jobs hetween operatina systems. 

- transfer of output between operating systems. 

- traisfer of files between operatina systems. 

- access to GCOS & files and databases through Session 


Control Proarammatic Interfaces to a GCOS 8&8 foreian 
syst2m access control workstations. 
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These protocols are similar to those used between 6COS & 
go0erating Systems; However, due to architectural 
differenc2s odetween GCOS & and other operatina systens the 
fotlowing limitations can be expected. 


- The set of functions which GCOS 2 supports for non-HIs 
syst2ms will be less than for other HIS systems. 


- Thes2? systems may not he able to fully satisfy the 
multicomputer objective for end user transparency. 


- Thes2? systems may not he able to fully satisfy the 
multicaomouter objectives for system availability and 
recovery. 

6.2022! Ither JIS Jperating Systems 
It 158 expected that other HIS -opneratina systems witl 
suoaort trese functions for the concurrent execution of GCOS 
B® ard other HIS onveratina systems in a multicomouter 
comolex. This includes: 
- Multics 


- CP=5 


- Level -54 


Level ~-5 


The nulticomputer functions suonorted by each of these 
o9erating systems will be specified in a separate PFS for 
e3ci onerating system based on the marketina requiremnents 
for the systen. 


Aeleler NOnNgHIS Iperating Systems 
Suoport for non~Honeywell operating systems is Limited 

ts the degree to which the user modifies such systems to 
corform to: 9SA and multicomputer conventions. Any special 
functions required in GCOS &® to support non-HIS operatina 
Systems will se specified in a separate PFS for each such 
system based on the marketina requirements. 

6.3 Resource YWanag2ment 


Resource Management consists of a collection of system 
functions which inctude: ’ 


- Configuration manacement 


- Allocation of peripheral devices 
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- Medi3 RAS newene 

- Allocation of files and databases 

- Access control to files and databases 
- Schedulirg of activities 


The multicomouter system design requires major 
enhancemerits to current resource management in all of these 
areas. I1 siigle computer systems these functions are often 
1ynored 3° conbined. For examonles the allocation of a tape 
drive (d2vic2) and the mounting of a tape reel (media) are 
ofte 1 a single function. 


In the nulticomputer environment the distinction between 
these furstioris nust he well defined and maintained. For 
exanole, a the multicomputer environment it must he 
possible to allocate a tane drive (device) to a conputer 
System wiich will subsequently allocate that tape drive to 
an activity ard mount a tape reel (media), 


The nulticomputer system design provides extremely 
flexible resource Mmanaqement which supports the many new 
kinds of device types and device -usage that will occur. 
This includes: 


- Local. >eripherals - Peripherals restricted to use by 
the comouter or computers to which they are physically 
coniected. 


- Non cocal Peripherals - Peripherals which may be used 
by conputers other than the computer to which they are 
physically connected. 


- NPataoaase machine - The implementation of a. database 
physically connected to one or more computers in the 
comolex which provide storage management workstation 
services to the rest of the comolex. 


-- Storage hierarchy facility - A data access facility 
characterized by data beina transferred between 
physical storage devices of different speeds and costs 
based on the data usage or specific requests. 


-- Device pool - The overall cost of a multicomputer 
configuration can be reduced by having a single pool of 
logically or physically shared devices rather than 
haviyg to confiaure sufficient devices at every 
comauter to handle the maximum resource requirement. 


In order to help tune the performance of a multicomputer 
coafiqurat i131 to snecific site requirementss Resource 
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Management suodports the concept of a “resource oartition™,. 
A resdyurce partition is defined as "a set of physical 
resdurces and the set of all activities which use that 
resdurce set. Furthers these activities do not reference 
airy resdurce outside of the resource partition.” Resource 
partitions car be defined either statically or dynamicalty 
denending or the resource management atlaorithms used. 
Various algorithms will be studied before making the _ final 
design decsission. 


The use of resource partitions Improves oerformance by 
Limiting the scope of operations. Thus: 


- Limitiiyqg access to a set of peripherals to a subset 9 f 
all comduters 1n a complex (possibly to one computer). 


- RPedut:iiqg inter-computer data traffic 
- Limitiyg concurrency control to the resource partition. 
6.3.1 Corfigquratior Management 


The narket requirements for the multicomputer system 
design call for a wide range of confiaurations to satisfy 
the variations in customer annlications. Further, it 78 
exsected trat multicomputer confiaurations will reflect 
evolutiorary changes taking place in the computer industry. 
Thereforas the software desian for multicomputer must be 
oper-ended and flexible. 


- The numoer and type of comnuters which can be 
configured in a multicomputer complex must not be 
arbitrarily Limited by the software. 


- The rumber and type of peripheral devices and their 
topology within the multicomouter complex must not be 
arbitrarily Limited by software. 


-~ In order to provide improved availabilitys the software 
must allow for the complete redundancy of alt hardware 
and software components. 


The nulticomputer system design oaqreatly exoands the 
software and hardware view of peripheral devices. DMevices 
wnich are currently considered as stand alone devices (e.gee 
olottersr page printerss etc.) wilt be handled as standard 
p2risheral devices. In additions evolutionary chanaes witl 
introduce many new devices. Some of the perioheral devices 
ares 


-- disk devices 
- tape devices 


Tr 
used 
tie c 
utati 


The 


The ¢ 
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unit record devices 
archival storaaqe devices 


- olotters 
- ootical document readers 
voc? indut/outout devices 


cage printers 


The nanrer in which these devices can be confiaqured = and 


is flexiole and open-ended so that each site can tailor 
o1fijguration to meet its availability, performances and 
zation r2quirements. 


folloawing device configuration attributes are defined: 


Shared device connection - A device which is physically 
conyected to more than one computer simultaneously. 


Non-shared device connection - A device which is 
ohysically connected to only one computer. 


Local device use - The condition in which a device is 
used dy an activity in the computer to which that 
devize is physically connected. | 


~Non-local device use - The condition in which a device 


7s 39 Sed by an activity in a computer other than the 
comauter to which that device is physically connected. 


Sharad device use - The condition in which a device is 
used by nore than one comonuter simultaneously. This 7s 
indersendent of shared or non-shared connection and 
local: or non-local use, 


exclusive? device use - The cendition in which a device 
is ssed hy only one computer. This is independent of 
shar2d or non-shared connection and local or non-local 
uS@. 


onfijyuration capabilities supported are: 


The device attributes defined above may he assiaqned to 
any pe®ripheral device. Each type of peripheral device 
will have default values based on the device type and 
the type of physical connection. However, the site may 
specify other values through confiquration parameters. 
For 2xanoles while the default values for tapes may be 
focal: use oanlys the site may specify non-local use of 
specific tape drives. 


Any typo? of peripheral device may be physically 


conn2cted as either shared or non-shared. Note, 
diff2rent occurrences of a device type can be connected 
in different ways. | 
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Nevices are assiqned local or non-local use attributes 
based or device types, physical connections and site 
soecified parameters. The non-local use of devices 
will result in additional CPU overhead and 
inter-conputer data traffic. This must be considered 
wher evaluating the benefits of assiqnina the non-local 
use attribute. 


Hariware and software redundancy 1s supported for all 
coma nents necessary to access a peripheral device. 


6.%ec Allocation of Peripheral Devices 


o f 


The allocation of oerioheral devices includes two Levels 
devic:> allocation. Nevices are allocated first to a 


canauter and then to an activity at that computer. The 
functions supdasorted are: 


AlLotation of devices to computers. 


+ Any computer In a multicomputer complex may 
request the resource manager to allocate a device 
to that computer for shared or exclusive use. The 
request will be satisfied hased on the request, 
device connectionse device types, and device use 
atvtrahates. 


+ A local or non-local device 1s allocated based on 
the requests device availibility» and device 
attributes. Allocation attempts to minimize 
non-local device use. 


AltlLoztatian of devices to activities. 


+t Any peripheral device can be allocated to an 
activity for shared or exclusive use based on the 
requests the shared and exclusive use attributes 
of the devices, and how the device is allocated to 
tne computer. The valid combinations of computer 
and activity allocation are: 


| Shared activity !| Exclusive activity 
| allocation | allocation 
seas cca in ies te es ts Stems os | seean <i cscs “cla bia “tl, ag: sls" Sie “ine “i <eants | “din msi sna. cc <item ii ses * ems sa: si “<i 
Shared I i 
computer | yes [ no 


allocation ! | 

we we wren een | en m2 eo -- = -—---- wae | we wn nw ne oe oe 
z=xclusive | | 

conputer | yes | yes 
allocation | | 
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6.3.3 “Media Maragenent 


Media Management is the system component responsible for 
keeo71ng track of all disks tapes and other media volumes 
used on a conouter system, Media are manaaed in terns of 
volune s3ts Cone or more volumes used as a set). Media 
Managemert provides the facilities to support computer 
operation (resource allocation) and media library operation. 
This is azshieved by a Media Mananement maintained catalog of 
volune setse a pool of unassiaqn volumess and a number of 
Media “aragemant interfaces. 


- Media YWanraqement maintains a global media catalog (for 


the multicomputer complex which may be either 
distriouted or centralized. 


- The nedia may be oraqanized into one or nore media 
Libraries. Media Management provides facilities for 
both tenoorary and permanent transfer of media between 
libraries. 


- Atl nedia (€1.@.ee tape reels, disk packs» etc.) are 
manajed as volume sets consisting of one or more 
entities. 


- Atl nenbers of a volume set are always assiqned to 
physical drives with equivalent physical connections 
and device use attributes. (1.eC@.2 the identical 
accessibility by computers). 


4.3.4 &llaocation of Files and Databases 


Pesyurce Management provides all of the functions necessary 
for effective and reliable use of larae databases in the 
mualticomouter confiauration. These functions include: 


- Patarasase partioning 

~ Replicatead files 

-- User initiated file transfers 

- Syst2m inritiated file transfers based on file use and 
$122. 

- Data inategrity control to the files control interval, 
and field levels for the multicomputer complex. 

- File and database tocation independence, 

- File ard database data independence. 

- Audit trails. 

- Before ard after journalization. 

- Pecoyvery after distruption. 


wr 
we ¢] 
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6.27.5 Scisduliig 


The scneduling of activities in the multicomputer 
comolex is a key component in providing confiauration 
trarsparericy. The end user will only accept transparency if 
the scheduler provides efficient use of the computer 
facility. The scheduling of activities and the allocation 
of resourztes are done considering both globat and tlocal 
factors. Tre factors considered include: 


- Noerating system type and release 

- Admirvistrative overrides 

- User overrides 

- Pesdurces required Cincluding processor type) 
- Glooal r2source usaqe 

- Load lLaveling 

- Snecial orocessors 


betede!] FBAtChH Screduling 


The nulticomouter batch scheduling capabilities will be 
based or the ASA Network Batch Protocol and the DSA Remote 
File Access Protocol. The batch schedulinq capabilities 
include: 


- The inodut locations execution locations and odutput 
location are all independent of each other unless 
soecified by the user, 


* Activitie?s within a job are automatically directed to a 
comouter for execution considerina the above factors.e 


- Activities within a job may execute in separate 
compatible computers with the same operating system, 


- Qutout 18 automatically directed to the apnpronoriate 
locat:ion. 


6.2.5.2 Timesharing Scheduling 


The nulticomputer timesharinq capabilities will be 
erharced to provide the following: 


~- A tinesharitng user can request timesharing services on 
a sd2ecific computer in the multicomputer complex. 


- A tinesharing user can reauest timesharina as a qeneric 
service. The user's session will he scheduled on a 
comauter based on: | 


t+ which computers are executing timesharing 
# the user resource requirements (Cuser profile) 
+ tne loading on the various computers 
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In trIs situation the computer which handles the user 
sessior 1s transparent to the user, 


6.3.5.% Trarsaztioaa Processing Scheduling 


The nulticomputer transaction processing facilities 
will he erhanced to provide the following capabilities: 


- A transaction processing user can request transaction 
proc2ssiig services on a specific computer in the 
multicomouter comolex. 


- A transaction processing user can request transaction 
proc2ssiv%g aS a qeneric service. The user's session 
will ba scheduled on a computer based oan: 


+ which comouters are executina transaction 
processing 

+ the user resource requirements (Cuser orofile) 

+t tre loading on the various computers 


In this Situation the computer which handles the user 
$essi01 1s transnoarent to the user, 


4.4 Files and )ataoases 


The dajectives of the Distributed System Architecture 
and the nulticomputer system design ares: 


- The nulticonputer complex appears to the end user toa be 
a siigle computer system, 


- The nulticomputer comolex conf iquration 1s typically 
trarsparent to the end user. 


~ Att batches timesharings, and transaction processing 
aoplications have a sinale end user view of the files 
and dataoases which they use. 


- Fase of use for the applications programmer, 


- [Isolation of the user from details of the datahase 
desijn and the steps required to retrieve data. This 
allows: 


+ The database to he changed without requiring 
application nroarams to chanqe. 


+t Inter-computer database communication between 
macrines of different personalities Ce.seq.r Lé%, 
L5&e L64, L&ése etc.). 
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The full realization of these capabilities can only be 
achieved by evolution to the news highty structured and 
staydardizeds DSA data storanae and retrieval hierarchy. The 
DSA storage and retrieval hierarchy consists of the 
following layerss 


- Nata independent access 
- Rata dependent access 
= Record arid file access 
7 Storage access 
- Nevice access 
-— Physical access 


The DSA storage and retrieval hierarchy is a tlonq term 
developmerit. Therefores, a short term solution to the file 
ard database architecture must be nrovided in multicomputer 
for the following reasons: 


- Syst2m components and anolications require a duolicate 
file capability to provide additional availability. In 
order tt) provide maximum availability these files must 
be ascessible from multiole computers. 


- Curr2nt apolications must he accommodated in ” the 
multicomouter environnent,. 


The $208 8 Buffer Manager oerforms functions similar to 
tne storage access layer of the DSA storaae and retrieval 
hierarchy. Therefore, the Puffer Manager will provide the 
following storage access layer functions (see Fiqure 6.4.1): 


- The suff2r Manaaer functions as the system-wide space 
manager for control interval buffers for all files. 
Its prinary function is the bhalancina of space 
requirements for "main memory” file buffer space. As 
sucne it works as a software controlled disc buffer 
att2nptirig to reduce reat I/0 by maintaining frequently 
acc2?ssed data blocks 1n main memory. 


- The 3uffer Mananer is responsible for providing a 
duplicate file capability. When the duplicate file 
option is specified, Suffer Manaaer writes every update 
to trie orimary file and the duplicate. If there 1s a 
failure involving the orimary files Ruffer Manaacer 
inter facina with File Management automatically switches 
to tre dupticate file for all operations. In addition, 
Puffer Manager can read from either copy based on 
device usaqe. | 


- The 3uffer “Manager also provides an inter~-computer DSA 
inter face to accesses at the control interval level, 
files located at remote computers. 


Multidccomputer §8052243 37 1 
Global PFS DRAFT 


Figure 6.4.1 
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46.5 Oo2rator Literface 


The Joerator “essaqe Manager must provide’ functions 
whic? allow the multicomouter complex to be manaqed as a 
single systen,. Since the topoltoay of the multi-system 
comolex 38s siaqnificantly different than a sinate comouter, 
this imposes a number of requirements on the Operator 
Terninal Yarager. | 


- ALL dXperator messaqes must be assigned td messaqe 
catejyories which are based on the type of operator 
function. These categories should include: 


tape library requests 

tap2® mounting 

urit record actions 

general information (job starts ends etc.) 
jod scheduling management 

configuration manaqement 

et Ce 


+r r+ ee 


- Tt nust be possible to specify operator functions 
(1.2. nessage categories) for each operator terminal. 
Not?» the functions of several operator terminals may 
overlaod. 


- Fach ooerator terminal must te able to restrict its 
functions by configuration attributes Ce.g.e, all tape 
mourt nessages for a specified list of tape drives). 


- Suda ort an unlimited number of operator terminals. 


- Supoort operator terminals connected via data 
comnunications Lines. 


4.6 Startuo 


The nanier in which operating system startup occurs must 
be restructured into a number of steps with each step using 
functions provided by previous steps. For examole,r the 
S2ssion  cayer must be tnitialized early itn startup so that 
Systean Work Stations can be initialized and communicate 
duriya startud. 


The conceost of a software configuration must be added to 


startud to support the initiation of specific system 
workstations according to software configuration oarameters. 
There ar? a number of system workstations for which 


9ccurrenc2s of the workstation do not man one to one with 
conoxuters in a multicomputer confiauration. 
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The startup procedures used in the multicomputer system 
must be corsistent with the view of a single comouting 
comolex. Thuss the startup of = any Individual computer 
Tavaoalves an exchange of confiauration and status information 
wit? the rest of the complex. The startup of each conpnuter 
must be done with the cooperation of that portion of the 
multicomouter complex which is already started. The 
following functions are required: 


- Startuo of all computers in a multicomputer complex, 
excedyt the first computer to be starteds must require 
mininal operator involvement so that consistency of 
startud parameters across the multicomputer complex 15 
assured. 


- In addition to the standard local boot canabilitys an 
externally initiated (remote) boot capability must be 
provided which oermits the executina ovoortion of a 
multiconmaouter comolex to boot and coordinate startup of 
all o»ther computers 13 the comolex. 


- A faiking comouter must be able to automatically 
restart with the cooperation of the non-failing portion 
of tie complex. 


~ It nust ae possible to startup a multicomouter comrlex 
even trough one or more of the computers in the comolex 
may se uravailable, . 


A.7 Availability and Recovery 


The availability and recovery aspects of a multicomputer 
comolex nust be based on the specific user requirements. 
The user nust consider the following factors in determining 
his specific configuration: 


- The tyoes of applications to be supported. Zatch, 
timesharinas and transaction processina, 


- The Level of availability that is needed (utility 
qrad2, etc.) 


- Perfornance requirements in terms of numbers of users, 
response time, throuqhputs, etc. 


- Qveratll nrardware costs. 


The site must be able to confiaure both the hardware and 
software td meet its specific needs. The following 
functions are required for maximum availibility and recovery 
(These must be subsetable to meet the sites needs): 
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- The failure of any single entity (computers channel, 
devizres aperator terminals, software components etc.) in 
the nulticomouter complex must not cause the failure of 
the whole multicomputer complex. The capabilities 
required include: 


+ Use of duplicate fite functions for critical 
system files. 


+ Detection of a computer failure by other computers 
19 the multicomputer comolex through inter 
comouter availability protocols. 


+ The work of system workstations wilt either be 


assumed by a system workstation in another 
comoauter or the System workstation will be 
restarted in another computer from the last 


checkpoint. 


+ Tie multicomputer confiquration will be undated 
and adjustments made to schedulina oaraneters. 


+ User workstatiors which use system provided 
recovery capabilities witl be restarted in another 
comouter. 


+ Confiquratian of redundant hardware  CIOM, 
cartrollers channels» etc.). This includes shared 
physical connection. of devices for optimum 


performance. 


- Wher any new hardware, includina a new conputers is. 
installed it may he added to the multicomputer comolex 
without shutting the comolex down and restarting. 


Recovery in a multicomputer configuration will he 
conolex. Thuse the system nust perform many operations which 
are curr2antly performed by operators. In situations where 
operator intervention is reauireds, the system must provide 
orecise and easily understood interfaces. The functions 
required includes 


-- In order to reduce the coordination problem associated 
with recovering from journals, there must he: 


+ A single after journal oper file (with the 
capability for multiple conies). 

+t A sinale before journal per process (with the 
capability for multiple copies). 
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- Pestart of a failed computer in a multicomputer complex 
must require minimal operator involvement so that 
restart darameters are consistent with the non-failed 
memo2rs of the multicomputer complex. 


A.8 Statistical Data Collection 


Statistical data in the multicomputer complex must be 
consistent with the concept that the configuration 15 
totally transparent to the customer. Therefore, user 
hitling nust be independent of the processor and peripheral 
devices s2lected by the multicomputer resource manager. For 
exanole, 2PJ seconds alone is not a vatid unit of resource 
usages C?U seconds must be adjusted by an anorsdpriate CPY 
sseed factor. A second solution would he to have the 
hardware tount memory references. 


Statistical data records on this file wilt contain 
1Vformation which uniquely identifies the resource within 
the tmulticsconputer confiquration. For examole, the record 
for the execution of an activity will identify on which 
conouter the activity executed. 


A.9 Distributed Yaintenance Services (DMS) 


The Distributed Maintenance Services concepts must be 
exterded to include the multicomputer complex. Areas to be 
extenrded includes: 


- The 2rror logging facility in the multicomputer comolex 
must be nanaqed at the complex level. Information on 
the error log must include untque identifications of 
all failing components. Tn multiconputer 
configurations computers must be handled Like any other 
device. : 


- Remote maintenance services must be able to access the 
multicomouter complex as if it were a single system, 


- The renote maintenance service must be able to diagnose 
haridvare and software problems within a computer in the 
multicomouter complex. 


- The renote maintenance service must be able to access 
the 2rror loa when any one of the computers within the 
comolex has failed. 
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6.19 Subexecutives 


There is a eSsianificant amount of user visible 
functionality provided by various subexecutives. The 
trarvsaction processing CITP) and timesharing (TSS) 


subexecutives are of particular importance. In order for 
the multicomouter architecture to truly meet the user 
requiremeits., these subexecutives must be extended to 
ercomoass the multicomouter architectural objects. 


Those areas which must be extended includes: 


- The sudexecutives must he enhanced to interface with 
the data independent access layer of the data storage 
and retrieval hierarchy to achieve file and database 
location independence. 


- The subexecutives must be enhanced to optionally 
orovide Load tlevelina capahilities of subexecutive 
users across the multicomputer complex. This can be 


done at var?ous times includina loa tins, transaction 
initiation or periodic rebalancing. 


- A conputer failure within a multicomputer complex must 
b2 trarasparent to the subexecutive users on other 
comouters and cause minimum disruotion to users an the 
failing computer. 


- The nulticomputer configuration must be transparent to 
the susexecutive users. 


The fFurctions supported by each of the GCos 8 
subexecutives wilt be specified in a senarate PFS for each 
Malticomouter release based on the market requirements for 
tnat release. | | 
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7. Multiconputer Performance 


The actual operformance of any given multiconputer 
comolex will deonend heavily on the hardware confiaquration 
and software sptions chosen. The orimary factors that will 
affect perfornance are: 


- The specific availability options chosen, 
- The 31ze and distribution of the user database. 
- The rate and distribution of database usaqe. 


- The nanner in which nerioheral devices are confiqured,. 
The numder of devices», whether devices are shared 
betw2er computerss etc. 


In 9*0der to achieve optimum performance “for each 
customer, the multicomputer system design must include the 
tools rec2sSsary to accurately measure user performance = and 
provide tha information needed for better configuration 
managenerit. Tools for distributing (nartioning) the user 
database will be particularly important in achievina ootimum 
performnarze. 


Multicomputer 58053243 44 1 
Global PFS DRAFT 


8.0 Standards 
Software releases for the multicomputer system design 

conforms to HIS standards apnolicable to software products as 
required sy Honeywell Poticy HIS-8- Product line Unificatian 
ard Stanjardizations and Distributed Systems Architecture. 
Anolicabl2 standards includes 

- PSA Reference Rook - Volume 1 (General Description). 

~ HONA Link Protocols NT-51 

- HONA Network Protocols NT-52 

- HDN4 Transport Multiplex Protocol, NT-453 

- DSA Connection Protocols ASA=5G4 

-— DSA Dialog Protocols, DSA=-55 

- DSA 2resantation Control Protocols, NSA-56 

- DSA Vetwork Batch Protocols, DSA-60 


- Remote File Access Protocols, SSA-47 


- Remote File Access Protocol Logical File Transfer 
Suos2tse ISA-62 


2.1 BDesi314 Stardards 
The Multicomputer desiqn will be a ton down desian usina 
Wellmade stardards. This will result in several important 
nenefitss: | 
- Improved design documentation 
- Improved productivity of desianers and developers 
- Improved product reliability 


- Improved product maintainability 


- Modularity and reduced intermodule dependence 
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Rl? Imalenentation Standards 


Att imolementation wilt be done in ae high Level 


inolementatior languaae wherever feasible. This wilt result 
11 several imogortant benefits: 


may 


Improved productivity of developers 


Inproved product reliability 


Improved product maintainability 


Modularity and reduced intermodule dependence 


Standard interfaces 


The inplenentation of the multicomputer system desiqn 


result in increased system overhead, In order to 


prevent arty such overhead from becoming intolerables, the 
following measures will be takens 


Within DSA Implementation standards, every possible 
measur2 must be used to prevent unnecessary overhead. 
Thes> standards may be modified after implementation, 
if 1istrumentation shows it ts needed. 


The impact of the multicomputer implementation on 
single computer confiqurations must be minimized, 


The imolementation must include the metering necessary 
to neasure and correct any performance problems that 
may occur. This should be done in a manner that 
pernits easy removal for production systems (e.aes via 
macrads). 


The increased overhead required to support the 


malticomouter complex will be compensated by the following: 


Qverread must be compensated for by better utilization 
of r2sources at the global level. 


Overrvead must be comnensated for by Imoroved 
availapility of the system. That iss less down time, 
less time? spent recoverina from failuress etc. 
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9.9 Docunentatior 


Pocun2ntation must be provided for all the prospective 
users of trie the multicomouter system, The Level of detail 
ranqyes from concept documents for orospects and end-users 
through detailed reference manuals necessary to start and 
oonerate a Miulticomputer Complex. 


Nhile initial documentation may o f necessity be 
conventisiral hard copys full coqnizance should be taken of 
the advantages of on-line documentations either via terminal 
disolay or usina microfiche readers and the like. The need 
for docunertation mobility within a multicomputer system 
will be at’ least as great as the need for anoplication or 
database modility. This need will impact hoth the document 
distributior costs to deliver the documentation to the user 
and the costs to transfer the documentation within the user 
System e2rIViI ronment. The following sections define the 
vartous categories of documentation required. 


9.7 End-JUser Documentation 


End users as defined here include casual users of the 
systense, ADN-“cImMputer professtonals and interested observers. 
Docunentation required for this class must emphasize the 
hasic conzsepts employed tonaether with the mechanics of 
iyteractiig with the system (e.qaer siqn=ons application 
activatiois, file accesss communication with another end 


user). Yocunentation at this tlevet will not describe 
inolementation or detailed tlanquaae ODtIONS. On-Line 
docunentation will emphasize a "first aid™ canabilty, 
usually i1 response to a “help” command. Helo must be 


provided 12%: both training (for first time users) and 
diagnostiz (for unexpected conditions) mode. 


- A MWMualticomputer Concepts and Facilities Manual is 


required and must be available at the time of 
annoduncenent of the first release. This document 
describes both the system desian and how the 
multicomouter capabilities can be used. The document 


should indicate the kinds of multicomputer complexes 
that can be confiqured and how they would be used to 
Supod rt a customer's business needs. 
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9.2 Apolicatiay Development Documentation 


Anoplization development is primarily concerned with the 
jnolementatior of new procedures. Documentation required 
for this cslass of user must include reference documentation 
91 the various orocedure languages supported as well as 
iastructions for Linking new procedures into the existing 
avolicatioyn structure. Amona the most important asnects of 
this intejration phase are: 


- How td establish a communication session In a 
multicomouter system. 


- How to send and receive messages within a communication 
session. 


- How to iriterface to the database facility. 


How to 11teract with other tasks and proarams. 
°.% Multicomputer Administration and Control Documentation 


The nulticomouter administrator controls the entire 
comolexs r1e1c9# this area demands the richest and most varied 
docunentat or. Basically» six cateaqories of documentation 
are required: 


- Comolex configuration and startup 

- Dataoase distribution and definition 
- Catalog and Media manaqement 

- Access control and security 


- Operation of the complex Cincludinq monitoring and 
tuniirg). 


-- Configuration management (Creconfiguration) 


Within 2ach Oo f these categoriess two classes of 
documentation are required: The formalism for accomnlishina 
the objective and the assistance in makinq the decisions 
tnenselves. Here aaqain an on-line capability is hiqhly 
desirable because of the opportunities for linking decision 
considerations with on-line system performance monitorina 
tools. 


10 
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Glossary ( For OSA definitions see the DSA Reference Book - 
Apoendix A » DSA Terminology ) 


Activity - An activity is the basic unit of work which is 
scheduled. Some examples of activities includes: 


+ A jad step Cbhatch) 
+ A sessior Ctimesharing) 
+ A transaction (transaction processina) 


Arciival storage device - A mass storace device intended for 
infrequently accessed files. Typically such devices offer 
low coster high capacity storaae at the expense 9 f 
accessibility. Due to the access time characteristics of 
theses devices, files are often staned to hiah speed devices 
durina periods of file activity. 


Buffer Maragenent - A software component in GCOS 8» similar 
t> the 2SA storage access tayer, which functions as a 
System-wide soace manager for data files. Buffer Manaqement 
functions as a software controlled disk buffer - attempting 
to reduc2 real I/0 by maintaining frequently accessed data 
blocks resideit in main memory. 

Concurrensy Control = See data intearity control. 

Conouter System - An information processing system 
consisting 29f a single computer and operating system 
suoaartiyg apolication programs and datahases. 

Nata deperident access Layer ~- A PSA term, 

Nata iIndersyendence ~- A ASA term. 

Data inderserident access layer - A OSA term. 

Data integrity control ~ A OSA term, 

Data Storage and Retrieval Hierarchy ~ A DSA term 

Database - A collection of files of stored records where 
each fil2 is constructed in accordance with ae sinate 
storaqge-oriarited data description. 

Patabase Processor - A functionally dedicated computer 


respansisle for the functions of the storage management 
workstation. 


Device acsess layer ~- A DSA term. 
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Dise Cacie 3uffer - A buffer, transparent to central 
processo*: softwares which is used to reduce real disc I/0 by 
maiatainiryg frequently accessed data blocks ina dedicated 
seoarate nenory. The cache would be tocated close to the 
discre oossiolty in the controller or channel. 


Exd-User - A DSA term, 


Functionally Jistributed - In a functionally distributed 
Syst2m aztivities are searenqated based on their type. A 
database processor is one example of a functionally 
sapecializ2d processor. Other types o f functional 
sosecializatiory include higher level language processors such 
as COROL or FORTPAN machiness and special editing or text 
ore d eration orocessors,. Specialized hardwares, firmware and 
software syst2ms can be employed in a specialized processor, 
suc as a firnware interpreter in a directly executable 
higher Lavel language processors or it can simply involve 
the furctioral dedication of conventional systems. For 
exanol®s, a database machine can be confiqured from standard 
hardware and software oroducts. 


Inter-Conduter Connection - A direct physical link between 
eac} conduter in a multicomputer complex and every other 
comouter in the complex. 


Layer ~- A DSA term, 


Multicomouter Complex - A Multicomputer Comnlex consists of 
two or nore inter-connected computer systems and sufficient 
software to coordinate resources and oprocessina activities 
at the nulticomouter level. The complex typically will be 
Lacally distributed and will present a sinaqle end user view 
of thie system. 


Physical access layer - A OSA term, 

Record ani file access layer ~- A DSA term, 

Session Control Programmatic Interface - A NSA term. 

Session Layer ~ A DSA term. 

Shared Logical Connection - With a shared loaical connection 
multiple hosts can access a single device. This device 
miaht be dshysically connected to only one computers or may 
b2 ohysicaltly shared by multiple computers. If physically 


connected to only one computers, that computer will access 
the devic2 for other hosts itn the complex. 
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Shared Physical Connection = A shared physical connection 
requires direct ohysicat oaths between an I/0 device and 
multiple rosts. This does not imply that the device in part 
or 179 whole will be used simultaneously by more than one 
host. 


Storaae Access Layer ~- A DSA term. 
Storage Management Workstation - A DSA term. 


System Work Stations - System workstations are workstations 
whict Sudport system functions. The user visibility of 
these worxstations is equivalent to that of conventional 
ooerating system processes. System workstations in the 
multicomoauter environments will provide functions including 
Subdsysten naragerss availability/recovery functionss and 
scheduliyj. 


